Re-examining Google Tri-grams Measure (GTM) Sentence Similarity
نویسندگان
چکیده
منابع مشابه
Text Similarity Using Google Tri-grams
The purpose of this paper is to propose an unsupervised approach for measuring the similarity of texts that can compete with supervised approaches. Finding the inherent properties of similarity between texts using a corpus in the form of a word n-gram data set is competitive with other text similarity techniques in terms of performance and practicality. Experimental results on a standard data s...
متن کاملUsing Sentence Similarity Measure for Plagiarism Source Retrieval
This paper describes a method that was implemented in the software submitted to PAN 2014 competition for the source retrieval task. For generating queries we use the most important noun phrases and words of sentences selected from a given suspicious document. To download documents that are likely to be sources of plagiarism we employ a sentence similarity measure.
متن کاملSentence Extraction by Spreading Activation with Refined Similarity Measure
Although there has been a great deal of research on automatic summarization, most methods are based on a statistical approach, disregarding relationships between extracted textual segments. To ensure sentence connectivity, we propose a novel method to extract a set of comprehensible sentences that centers on several key points. This method generates a similarity network from documents with a le...
متن کاملA Link Grammar and Semantic Corpus Based Sentence Similarity Measure
A novel sentence similarity measure that based on grammar and semantic corpus is presented. The well-known problem in the field of semantic processing, such as natural language processing, QA systems, expert systems, search engines, etc., is trying to evaluate the semantic similarity between sentences or articles. The major challenge is to evaluate the similarity of sentence-vs.-sentence since ...
متن کاملSentence Similarity by Combining Explicit Semantic Analysis and Overlapping N-Grams
We propose a similarity measure between sentences which combines a knowledge-based measure, that is a lighter version of ESA (Explicit Semantic Analysis), and a distributional measure, Rouge.We used this hybrid measure with two French domain-orientated corpora collected from the Web and we compared its similarity scores to those of human judges. In both domains, ESA and Rouge perform better whe...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Knowledge Engineering
سال: 2017
ISSN: 2382-6185
DOI: 10.18178/ijke.2017.3.2.091